Towards Stratification Learning through Homology Inference
نویسندگان
چکیده
A topological approach to stratification learning is developed for point cloud data drawn from a stratified space. Given such data, our objective is to infer which points belong to the same strata. First we define a multi-scale notion of a stratified space, giving stratifications over a set of resolution levels. We then use methods derived from kernel and cokernel persistent homology to cluster the data points into different strata, and we prove a result which guarantees the correctness of our clustering, given certain topological conditions; some geometric intuition for these topological conditions is also provided. Our correctness result is then given a probabilistic flavor: we give bounds on the minimum number of sample points required to infer with high probability which points belong to the same strata. Finally, we give an explicit algorithm for the clustering, prove its correctness, and apply it to some simulated data.
منابع مشابه
Stratification Learning through Homology Inference
We develop a topological approach to stratification learning. Given point cloud data drawn from a stratified space, our objective is to infer which points belong to the same strata. First we define a multi-scale notion of a stratified space, giving a stratification for each radius level. We then use methods derived from kernel and cokernel persistent homology to cluster the data points into dif...
متن کاملSheaf-Theoretic Stratification Learning
In this paper, we investigate a sheaf-theoretic interpretation of stratification learning. Motivated by the work of Alexandroff (1937) and McCord (1978), we aim to redirect efforts in the computational topology of triangulated compact polyhedra to the much more computable realm of sheaves on partially ordered sets. Our main result is the construction of stratification learning algorithms framed...
متن کاملSeparating Features from Noise with Persistence and Statistics
In this thesis, we explore techniques in statistics and persistent homology, which detect features among data sets such as graphs, triangulations and point cloud. We accompany our theorems with algorithms and experiments, to demonstrate their effectiveness in practice. We start with the derivation of graph scan statistics, a measure useful to assess the statistical significance of a subgraph in...
متن کاملThe Prediction of Forming Limit Diagram of Low Carbon Steel Sheets Using Adaptive Fuzzy Inference System Identifier
The paper deals with devising the combination of fuzzy inference systems (FIS) and neural networks called the adaptive network fuzzy inference system (ANFIS) to determine the forming limit diagram (FLD). In this paper, FLDs are determined experimentally for two grades of low carbon steel sheets using out-of-plane (dome) formability test. The effect of different parameters such as work hardening...
متن کاملPerceptions of Iranian EFL Learners towards Learning Language through Computers: An Investigation on the TOEFL (Paper and Computer Based) Test
Computer technology has influenced the realm of language teaching and testing so drastically that no language teaching program could be imagined nowadays without the use digitized software and mul- timedia. This study aims at investigating Iranian EFL learners' performance on paper-based test com- pared with their performance on computer-based test while considering their attitudes towards the ...
متن کامل